Picture for Amy Zhang

Amy Zhang

Exploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RL

Add code
May 25, 2026
Viaarxiv icon

Reinforcement Learning via Value Gradient Flow

Add code
Apr 15, 2026
Viaarxiv icon

Filtered Reasoning Score: Evaluating Reasoning Quality on a Model's Most-Confident Traces

Add code
Apr 13, 2026
Viaarxiv icon

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

Add code
Mar 17, 2026
Viaarxiv icon

Regularized Latent Dynamics Prediction is a Strong Baseline For Behavioral Foundation Models

Add code
Mar 16, 2026
Viaarxiv icon

A Recipe for Stable Offline Multi-agent Reinforcement Learning

Add code
Mar 09, 2026
Viaarxiv icon

Factored Latent Action World Models

Add code
Feb 18, 2026
Viaarxiv icon

Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning

Add code
Feb 12, 2026
Viaarxiv icon

Hierarchical Entity-centric Reinforcement Learning with Factored Subgoal Diffusion

Add code
Feb 02, 2026
Viaarxiv icon

Learning Robust Reasoning through Guided Adversarial Self-Play

Add code
Jan 30, 2026
Viaarxiv icon